Analysis and recoding of multimodal data
نویسنده
چکیده
Emotions are part of our lives. Emotions can enhance the meaning of our communication. However, communication with computers is still done by keyboard and mouse. In this humancomputer interaction there is no room for emotions, whereas if we would communicate with machines the way we do in face-to-face communication much information can be extracted from the context and emotion of the speaker. We have proposed a protocol for the construction of a multimodal database and a prototype that can be trained on this database for multimodal emotion recognition. The multimodal database consists of audio and videos clips for lip reading, speech analysis, vocal affect recognition, facial expression recognition and multimodal emotion recognition. We recorded these clips in a controlled environment. The purpose of this database is to make it a benchmark for the current and future emotion recognition studies in order to compare the results from different research groups. Validation of the recorded data is done online. Over 60 users scored the apex images (1.272 ratings), audio clips (201 ratings) and video clips (503 ratings) on the valence and arousal scale. Textual validation is done based on Whissell’s Dictionary of Affect in Language. A comparison is made between the scores of all four validation methods and the results showed some clusters for distinct emotions, but also some scatter for certain emotions which depend mainly on the context. Context is not always available
منابع مشابه
A Multimodal Discourse Analysis of Some Visual Images in the Political Rally Discourse of 2011 Electioneering Campaigns in Southwestern Nigeria
This paper presented a multimodal discourse analysis of some visual images in the political rally discourse of 2011 electioneering campaigns in Southwestern Nigeria. The data comprised purposively selected political visual artefacts from political rallies across the six Southwestern States in Nigeria (Osun, Oyo, Ondo, Ekiti, Ogun, and Lagos). The data were analyzed using Halliday’s (1985) syste...
متن کاملA Critical Visual Analysis of Gender Representation of ELT Materials from a Multimodal Perspective
This content analysis study, employing a multimodal perspective and critical visual analysis, set out to analyze gender representations in Top Notch series, one of the highly used ELT textbooks in Iran. For this purpose, six images were selected from these series and analyzed in terms of ‘representational’, ‘interactive’ and ‘compositional’ modes of meanings. The result indicated that there are...
متن کاملAchieving Multimodal Cohesion during Intercultural Conversations
How do English as a lingua franca (ELF) speakers achieve multimodal cohesion on the basis of their specific interests and cultural backgrounds? From a dialogic and collaborative view of communication, this study focuses on how verbal and nonverbal modes cohere together during intercultural conversations. The data include approximately 160-minute transcribed video recordings of ELF interactions ...
متن کاملUprising in “Uprising”: A Multimodal Analysis of Bob Marley’s Lyrics
This paper investigates how the theme of uprising is conveyed in Bob Marley’s final music album by the name “Uprising”. Through the methodological lenses of multimodality, attention is focused on how the album cover design, lexical items, literary devices, and other aesthetic ways such as the titles of the ten songs of the album and their order of arrangement contribute to the overall theme of ...
متن کاملتأثیر آموزش به شیوه سخنرانی و چندبعدی بر بهبود زخم پای دیابتی و پایبندی بیماران به توصیه های مراقبتی
Background and Aim: Some patients with diabetes do not follow the foot care recommendations. Methods of patient education may affect the rate of compliance and ulcer healing. The present study aimed to compare the effects of teaching by lecture and multimodal method on compliance with foot care recommendations and healing of diabetic foot ulcers in kashan city, during 2011. Material &Method...
متن کامل